Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Detect and recover from a bad service node state #2241

Merged
merged 1 commit into from
Jan 28, 2025

Conversation

tewinget
Copy link
Collaborator

@tewinget tewinget commented Oct 3, 2024

If a node is deregistered while running (or is started before being registered), it can fail to properly connect to the network once registered (again). This does not fix that, but these symptoms should be sufficient to detect such a state, and the node should recover upon restart.

@tewinget tewinget force-pushed the stable-fix-reregister branch 3 times, most recently from c5bd948 to 0102f59 Compare October 3, 2024 23:20
@jagerman jagerman modified the milestones: 0.9.12, 0.10.x Oct 10, 2024
If a node is deregistered while running (or is started before being registered),
it can fail to properly connect to the network once registered (again).  This does
not *fix* that, but these symptoms should be sufficient to detect such a state and
the node should recover upon restart.
@jagerman jagerman changed the base branch from dev to stable December 11, 2024 00:50
@tewinget tewinget force-pushed the stable-fix-reregister branch from 0102f59 to c03eed3 Compare December 11, 2024 00:53
@jagerman
Copy link
Member

This has been carried as a patch in the debs for a couple months now, and I recently had a registration that reproduced the bug causing lokinet to get stuck with no peers, which successfully triggered this workaround and a restart that unstuck it without intervention.

@jagerman jagerman merged commit 9690676 into oxen-io:stable Jan 28, 2025
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants